Modelling binaural detection of speech stimuli in complex reverberant environments

نویسندگان

  • Tobias Weller
  • Jörg M. Buchholz
چکیده

In listening experiments that assess spatial hearing (e.g. localization) in the presence of background noise it is important to ensure audibility of the target stimulus. Target audibility could be easily controlled using knowledge about the masked thresholds in the respective paradigm. However, if a large number of acoustic conditions as well as target sounds are considered, individual psychoacoutic threshold measurements are not feasible and alternative methods need to be applied. In this study a binaural auditory model is applied to predict masked thresholds for wide-band non-stationary stimuli. This model closely follows the approach of Hant and Alwan (Speech Communication, 2003, Vol. 40, pp. 291-313) [7]. In the first stage of the model an auditory front-end generates an internal representation of the stimuli in both ears. The auditory front-end includes head-related transfer functions, auditory bandpass filtering, squaring, temporal integration, logarithmic compression and additive internal noise. In the second stage, a decision device combines d′ information across time, frequency and ears and provides an estimate of the masked threshold. Three different methods of combining information across ears were compared: A better-ear approach, a cross-ear glimpsing approach and an approach using binaural integration. The model was verified using psychoacoustic masked threshold data of a detection paradigm that considered a target word presented from 15 different locations in a reverberant multi-talker background. The model predictions were in very good agreement with the measured masked thresholds. It is concluded that this model is suitable to control audibility of a target stimulus in a complex reverberant environment. Moreover, the model was used to systematically analyze the effect of head-shadow, signal spectrum, auditory sensitivity and room reverberation on target audibility.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Binaural deep neural network classification for reverberant speech segregation

While human listening is robust in complex auditory scenes, current speech segregation algorithms do not perform well in noisy and reverberant environments. This paper addresses the robustness in binaural speech segregation by employing binary classification based on deep neural networks (DNNs). We systematically examine DNN generalization to untrained configurations. Evaluations and comparison...

متن کامل

Robust speech recognition in reverberant environments using subband-based steady-state monaural and binaural suppression

The precedence effect describes the ability of the auditory system to suppress the later-arriving components of sound in a reverberant environment, maintaining the perceived arrival azimuth of a sound in the direction of the actual source, even though later reverberant components may arrive from other directions. It is also widely believed that precedence-like processing can also improve speech...

متن کامل

Binaural Source Separation in Non-ideal Reverberant Environments

This paper proposes a framework for separating several speech sources in non-ideal, reverberant environments. A movable human dummy head residing in a normal office room is used to model the conditions humans experience when listening to complex auditory scenes. Before the source separation takes place the human dummy head explores the auditory scene and extracts characteristics the same way as...

متن کامل

Binaural Reverberant Speech Separation Based on Deep Neural Networks

Supervised learning has exhibited great potential for speech separation in recent years. In this paper, we focus on separating target speech in reverberant conditions from binaural inputs using supervised learning. Specifically, deep neural network (DNN) is constructed to map from both spectral and spatial features to a training target. For spectral features extraction, we first convert binaura...

متن کامل

Recent advances in fragment-based speech recognition in reverberant multisource environments

This paper addresses the problem of speech recognition using distant binaural microphones in reverberant multisource noise conditions. Our scheme employs a two stage fragment decoding approach: first spectro-temporal acoustic source fragments are identified using signal level cues, and second, a hypothesisdriven stage simultaneously searches for the most probable speech/background fragment labe...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014